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Routing for Reliable Manufacturing 

Ed P. Huijbregts, Hua Xue, and Jochen A. G. Jess 



Abstract— The impact of spot defects on the susceptibility for 
electrical failure of a net is analyzed. Based on this analysis, a 
general routing cost function is presented, in which the manu- 
facturability of a net is taken into account in conjunction with 
traditional routing objectives. The new cost function, relating the 
process spot defects to the routing procedure has been imple- 
mented. Failure probabilities are analyzed for the benchmark 
layouts obtained by our routing tool using both the original cost 
function and the new cost function. The results show that the 
failure probability of a layout is significantly decreased if the spot 
defect mechanism is taken into account in the routing procedure, 
while the area of the layout is kept constant 



I. Introduction 

ROUTING a net is a "classical" topic in CAD for VLSI. 
The problem can be formalized as the Minimum Steiner 
Tree problem in an appropriate routing graph 16]: 
Problem: minimum Steiner Tree, 

Instance: a connected undirected graph G = (V, E), also 
called routing graph, with edge cost function A : E — * R+ 
and a net N C V, consisting of vertices to be connected, 
Configurations: all edge-weighted trees, 
Solutions: all Steiner trees for N in G, denoted as E T , 
i.e., all trees of G connecting all vertices of N with all its 
leaves being vertices in TV, 
Minimize: A(T) = E ce i? T A(e). 

Many algorithms exist to solve the minimum Steiner tree 
problem, see [4] for an excellent overview. All of these 
algorithms will come up with significantly different routings if 
different cost functions are applied. Conventionally, the edge 
cost function A(e) is defined as the product of the distance 
d between two adjacent vertices and a control factor c, i.e., 
A(e) = cd. Parameter c is used to adjust the edge weights 
in or between the different mask layers. For example, by 
setting a larger value of c for the poly layer and a smaller 
value of c for the metal layer, connections with high signal 
propagation speed can be obtained instead of a real shortest 
path in distance. Furthermore, by choosing different values 
for c for different routing directions, thus favoring certain 
directions, a routing style can be imposed. Summarizing, the 
traditional cost function can affect a routing in three aspects: 
the net length, the performance, and the routing style. 
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As process feature size keeps decreasing and IC chips are 
becoming more complex, chips are more sensitive to process 
disturbance. Inductive Fault Analysis (IFA) [9] reveals that 
close nets are likely to get shorted because of spot defects, 
the main local disturbance in fabrication processes. Therefore, 
from the point of view of defect analysis, the yield of a 
good routing depends not only on the net itself, but also on 
the environment of the net. In other words, the minimization 
of the cost of a net in terms of the net length is not the 
optimal solution if the failure possibility of the net is taken 
into account. Obviously, the proposed cost function A(e) does 
not adequately cover this issue. 

The idea to relate the routing procedure to the process 
defects has been proposed in several papers [)], [2], [5], [8]. 
In [81, a channel router called Defect Tolerant Routing (DTR) 
was implemented to minimize the critical areas between the 
horizontal routing segments. Later on, the authors 11 J tried 
to minimize the critical areas between both the horizontal 
segments and the vertical segments by searching for valid gaps 
in routing channels. It has also been proposed that layout be 
modified in order to minimize via count and critical areas on 
each layer for two layer routing channels [5). In all papers, 
there are still quite a few drawbacks that make the routing 
results far from being effectively defect-tolerant. The main 
reasons are as follows. 

1) Only spot defects causing extra material (bridges) 
are considered. Consequently, when the probability 
of bridge faults decreases by minimizing the critical 
area for bridges, the probability of open faults, caused 
by missing material, likely increases. This is a valid 
assumption since routers generally try to minimize the 
net length, and therefore the probability of open faults 
is minimized. Modifying such a 'minimum net length/ 
layout in order to minimize the probability of bridge 
faults will usually result in longer net, and therefore the 
probability of opens increases. 

2) Only the single layer defect model is used for modeling 
the spot defects. The fact that in addition missing 
material or extra material between mask layers will give 
rise to more bridges or opens is not taken into account. 

3) The tradeoff between the increase of the number of vias 
(potential open sites) and the decrease of the critical area 
for bridges is not considered. 

4) Only one defect size is considered. However, spot de- 
fects are distributed with random sizes in reality. To 
accurately model spot defects, it is important to take 
into account the defect size distribution. 

In this paper, according to the defect size distribution and the 
process statistics, the failure probability of a net is analyzed. 
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Fig. I. (a) Faull type OE. (b) Fault type OM. pj g . 2. (a) Fault type IE. (b) Fault type IM. 



Based on the analysis, we propose a new edge cost function 
for the general routing problem. By applying the new cost 
function, a good tradeoff between the minimization of the net 
length and the minimization of the failure probability can be 
obtained for each net, which consequently leads to a better 
layout manufacturability. Part of this work has been published 
in [14]. 

II. Spot Defects 

The functional failure of a chip is likely caused by spot 
defects [12J. The result of a spot contamination in a process 
step is either extra material or missing material at the place 
where the spot occurs [7]. A spot defect may either occur in 
one layer of the silicon structure, such as the metal layer or the 
poly layer, or somewhere between two layers, where it causes 
extra or missing oxide. We classify spot defects as follows. 

1) One layer extra material defects (OE)'. The defects may 
cause bridges between connection patterns in the same 
layer. For example, the spot defect with size d in the 
metal layer will result in a bridge between nets I and 2 
as shown in Fig. 1(a). 

2) One layer missing material defects (OM): The defects 
will result in open faults if the spot defects break the 
connection patterns in one layer. Such a case where a 
spot defect breaks a net in the metal layer is shown 
in Fig. 1(b). If the defects cause missing via patterns, 
the open faults will also be induced because of missing 
vias. 

3) Inter-Layer extra oxide defects (IE): If the defects occur 
in the oxide at the location of vias, the vias may be 
blocked, thus leading to open faults. Fig. 2(a) shows an 
example where a via connecting metal 1 and metal 2 is 
broken by the spot defect. 

4) Inter-Layer missing oxide defects (IM): The defects-are 
also referred to as oxide pinholes. If the defects occur 
in the oxide between two overlapping conductors, the 
conductors are shorted. For instance, the pinhole in 
Fig. 2(b) causes a new via connecting metal 1 and metal 
2, and therefore results in a bridge. 

In the next section, assuming the above four types of spot 
defects to be the main random disturbance in the IC processes, 
we propose a formula to predict the probability of the failure 
of a net by taking into account the spot defect size distribution 
and the critical areas with respect to the various types of spot 
defects. Based on the formula, a routing strategy is suggested 
to minimize the probability of the failure of a net during the 
creation of the net. 



III. The Failure Probability of a Net 

Critical area, defined as the area in which the center of a 
defect must fall to cause a fault, can be extended to the critical 
area with respect to a particular object. The object can be any 
spot defect type. Suppose the spot defect size distribution for 
object 7; is D^x), and the critical area with respect to object 
t? is Ajjlx), where x is the spot defect size. If a uniform defect 
density P, } is assumed, then the probability of the failure of 
object ry, denoted as F n , can be expressed as: 

F„ = P„/ D n (x)A n {x)dx 

where rnin and max are the minimum and the maximum 
defect sizes. There have been many efforts on modeling the 
defect size distribution. In this paper, the size distribution 
function taken from [10] is assumed. In principle, using 
other size distribution functions will not affect the following 
discussions. By replacing function D v (x) with X$/x 3 < where 
X 0 is the peak defect size of the distribution, we obtain 

/max 1 
M*)-tdx. (I) 
lin 

As described in the previous section, the spot defects can be 
classified by four types. For each net, the critical area A v (x) 
with respect to the spot defects of type r? can be estimated 
by using the virtual artwork concepl proposed in [7]. Hence, 
the failure probability for each type of spot defect can be 
computed according to (1). 

Given a net A r , suppose the net length is /, and the net width 
and spacing are w and a, respectively. Assume 6 is the total 
length of the adjacent segments with the neighboring nets, and 
0 is the number of overlapping sites, i.e., the number of unit 
area overlaps with the conductors in the upper or lower layer 
as shown in Fig. 3. In addition, we suppose the number of 
vias on net N is v. 

1) Type OE: If the defect size x is smaller than ,s, then the 
defects will not cause any fault due to the zero critical 
area. If s < x < 2.s + u;, the critical area Aqe(x) is equal 
to (x - s)b. However, when the defect size is equal or 
larger than 2m + w, the critical area will be saturated to 
(ft + w)b. Consequently, the probability of the failure of 
defect type OE is 

Foe = P orA- 0 2 &(^ dx 

J2*+w •'' / 
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By setting max to oo, we obtain Foe = c*6 t with 

a= -r-{s-j27Tw))- (2) 

2) Type OM: When the defect size is smaller than ty, it is 
not possible that the net will be broken by the defect. 
Therefore, the critical area is zero. For the defects with 
size x,w < x<2w + s, the critical area /Iom(^) is 
equal to (x - w)l. As the defect size exceeds 2io + a, 
the critical area will be saturated to (to + s)l, similar 
to the defect type OE. Consequently the probability of 
the failure caused by the defects of type OM can be 
described as 

Fom = PomX&^J dx 

hw+H s 3 ) ' 
Similarly, we derive Fom = fth where 

2 U (2w + s)J- (3) 

3) Type IE: Since the defects of type IE will only break 
conductors traversing the oxide, i.e., vias, the probability 
of the failure caused by this type of defects is propor- 
tional to the number of vias on the net. It needs to be 
mentioned that the defects in oxide no longer follow 
the normal size distribution function. Here, we assume 
a simple model to estimate the failure probability. (Since 
the critical area for vias given defect size x is a function 
of x? y (1) will yield an infinite value for 7 if the size 
distribution function of [10] is used.) Suppose the size 
of a via is w x «/. The probability of failure can be 
estimated by F IE = 7", where 

7 = F TE ™ 2 . (4) 

4) Type IM\ The defects will cause parasitic vias between 
two layers of the silicon structure. However, the parasitic 
vias are functionally harmful only if the vias occur 
in places where two conductors overlap. As a result, 
the conductors are shorted by the pinhole defects. The 

~ overlap area can be treated as the critical area for the 
pinhole defects, assuming: 1 ) that a pinhole occurring in 
the overlap area will result in a bridge fault, and 2) there 
is no size distribution for pinhole defects. Therefore, the 
probability of failure caused by defects of type 1M can 
be estimated by Fim = bo y where 

t i = W. (5) 

According to the above analysis, the parameters 
A)E: Fom s Aej Fim, and A"o are process-related, while 
w and s are determined by the design rules. Since 
these parameters are independent to routers, the total 
probability of the failure F of net N can be given by 




Fig. 3. Explanation of parameters .% J>, /, o. 

summing up the probabilities of the failures caused by 
the different types of defects, i.e., 

F = Foe + F 0M + Fie + F £M 

= ab + 01 + yn + 6a (6) 

where a. 7, and 6 are given by the previous equations. 
It is obvious that the reduction of l>J, v, and o is an 
effective way to decrease the probability of the failure 
of net iV for a router. 

IV. New Cost Function 

Given a routing graph G = (V, E) with edge weights A(c). 
The cost of a net is defined as the sum of the cost of the 
edges of the Steiner tree that connects the terminal vertices. 
Let E T Q E denote the set of edges; then, the cost of a net 
is given by 

C=£A(e). (7) 

ceEr 

The goal is to find a minimum cost connection for each net. 
Wc combine the conventional cost function of (7) with the 
failure cost function of (6) according to 

<7 nr w = C + pF (8) 

In conventional routing algorithms, the goal is to achieve 
minimum total net length, implying minimum area. Thus, 
the conventional cost function is modeled as a minimum 
length cost function. In addition to the net length and the 
number of vias which are considered in conventional cost 
functions, the failure cost function introduces two new aspects, 
namely bridges and overlaps. In essence, minimizing both 
net length and bridges/overlap is contradictory. Therefore, 
for dense circuits, net length minimization should be favored 
over minimizing bridges/overlap because routing space is 
limited, as opposed to sparse circuits, where minimization of 
bridges/overlap may be favored over net length minimization. 
Thus, p is directly proportional to the sparsity of a circuit. Wc 
define the sparsity of a circuit as 

3 = 1 - ^ (9) 

where A n denotes the amount of space necessary to lay down 
all nets as estimated by the global router and A r denotes 
the amount of free routing space after placement. Notice that 
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pig. 5. Determination of cost using the new cost function. When minimizing 
bridges, variant 2 is preferred to variant 1 



Fig. 4. Grid model. 

maximally sparse circuits have s = 1 and maximally dense 
circuits have s = 0. Obviously, s < 0 indicates circuits that are 
not routable. 

Since p is a weight factor, it depends on the actual values 
occurring in the conventional cost function C. As we will 
show in the next section, we can derive a weight factor a to 
take into account this dependency. Thus, we may write p as 

p = sa. (10) 

V. Incorporating Routing Style 
We assume that the routing space is modelled as a 3-D grid 
graph G = (V, E). An edge e e E of the grid graph may 
have one of three directions, called x- t y-, and v-directions, 
as indicated in Fig. 4(a). Vias are represented by edges in the 
u-direction. Wires are allowed to run over edges and bend at 
grid points. An edge e 6 E of the grid graph is said to be 
active if it is part of a wiring pattern. Edges that are not part 
of a wiring pattern are called inactive. The status of an edge 
may be changed from inactive to active by the router. Possibly, 
initial wiring patterns exist in the grid graph. 

As mentioned in the introduction, we distinguish three 
aspects that may affect the edge cost function A(e). To cover 
these aspects, we assume that for each layer z, three costs are 
specified, namely cf ,c?, and c?. Here, cf denotes the cost of 
edges in the .x-direction, c\ denotes the cost of edges in the 
y-direction, and c\ denotes the cost of vias connecting layer 
i and i + 1. 

Let li = If + /f denote the total number of edges in layer i 
for a net, where If and 1? denote the number of edges in the 
x- and ^-directions, respectively. Furthermore, let Vi denote 
the number of vias connecting layers i and t + 1. Then, we 
may write (7) as 

Since the failure cost function is specific to some material, we 
assume that for each layer i a failure cost function according 
to (6) is specified, i.e., F = cv;k + ft/» + 7*«» + <W Tnen ' 
combining the conventional cost function of (11) with the 
failure cost function according to (8) yields 

+W + Pili)vi + PiOtibi + pibiOi. (12) 

Since the cost of vias are not influenced by any existing wiring 
pattern, we may discard vias from the following discussion, 
and set the cost of a via in layer i to v% + pm. 



Assume that /; is specified for each layer according to pi = 
sui. Furthermore, assume that the circuit is maximally sparse, 
i.e., s = 1, implying that p { = a*. Since the circuit is maximally 
sparse, we want to minimize bridges/overlap. 

In Fig. 5, a net N exists in the routing space. Connecting 
point A and point B y we want the net to follow variant 2 
instead of 1, because the critical area for bridges is minimal 
for variant 2. Using (12) and assuming the length of the net 
in the x-direction is given by l T , the cost of both variants are 
given by 

ft =(«? + (TiPi)h + OiOtiW 

C 2 = (<? + <Ti(1i)lT + 2(r:V + *&) 

using b = l T for variant I and h = 0 for variant 2. Since we 
prefer variant 2 to variant 1, we demand that Ci > C 2 and 
derive a lower bound for o if i.e., 



2(* 



(13) 



(14) 



Similarly, for vertical wires we derive 

2cf 

ffi> a i l T -2fl t ' 

For overlap we may derive the same functions, only substi 
tuting f>i for a,-, i.e., 

2c? . 2c" 



— and Oi > 



(15) 



Mr - 2ft 

Combining (I3H15), and setting (T { to the maximum lower 
bound yields 

2 max(cf, tf ) 



<7i = 



/jmiiifoj,^) — 2ft 



(16) 



As can be seen from (16), <x t - depends on both the cost 
information per layer and the failure parameters specific to 
each layer. Parameter h may be seen as a threshold net length. 
If the length by which two nets are in parallel (or overlap) 
exceeds this threshold, we demand that one of the nets will 
take a detour as shown in Fig. 5. 

VI. Computing New Edge Cost 
A procedure is given in this section to determine the final 
cost of an edge (see Algorithm 1); to be able to do this, 
the notion of surrounding edges is introduced. For each edge 
e 6 E in the x- or ^-directions, four surrounding edges are 
identified, denoted as 6*. 6 r , o„, and o { as indicated in Fig. 4(b). 
The edges b t and b r , lying in the same layer as edge «, form 
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the possible bridging edges, while o u and o ly lying in the upper 
and lower layers, respectively, form possible overlap edges. 

procedure determine.edge_cost (i.dir) 

begin 

ifdir = v then A := c}' + pm\ 
else 

A := cf r + Pi (h\ 
if 6/ is active then A := A + 
if b r is active then A := A + pi<ii\ 
if o u is active then A := A + p{6i\ 
if at is active then A := A + ftfij-i; 
fi; 

return A; 
end 

A. Algorithm J: Determination of the New Edge Cost 

In the above procedure, i is the index of the layer in which 
edge e lies, and dir denotes the direction of the edge, being 
either x, y, or v. No special actions are taken for vias; if the 
edge represents a via from layer i to layer i + 1, the cost 
A is set to c- ; + piii. The final cost is influenced by their 
surrounding active edges for all other edges. The edge e. is 
assigned the original cost cf r -f plus a cost for each 
active surrounding edge. The latter depends on the relative 
position of the surrounding edge with respect to edge e. It is 
easy to see that this procedure assigns the original edge cost 
if 5 = 0, implying p, = 0, for all i. 

The final edge cost depends entirely on the active edges 
by which it is surrounded, and therefore may change during 
routing. To avoid changes in cost due to interaction with 
already routed segments of the same net, it is assumed that 
an edge is activated only after all terminals of a net are 
connected. Notice that the above procedure takes constant time 
to determine the cost of an edge. Therefore, the run time 
complexity of the original maze router is not influenced by 
this new cost function. 

VII. Experiments 

The routing approach in which the layout failure mechanism 
is taken into account has been implemented in the GAS sea of 
gates layout system [11], using the multiterminal maze router 
of [3]. To test the real effect of our new routing strategy, 
20 different circuits have been laid out. Except for mult8 
and primes9, all circuits are taken from the MCNC '91 logic 
synthesis benchmark set. The scales of the layouts range from 
150 to 5000 transistors, while their numbers of nets range 
from 100 to 3500. After placement is finished for each circuit, 
the sparsity of a layout can be obtained according to (9). 
Basic information about the benchmark layouts as well as their 
sparsities is shown in Table I. 

All benchmark circuits are routed using the detailed router 
of the GAS system. To compare the results achieved by the 
newly proposed cost function, each circuit is laid out twice, 
once using the conventional routing cost function and once 
using the new routing cost function. Routing is performed 
using three layers: a polysilicon layer />j, and two metal layers 
in and ins. The original edge costs are set according to c£ a = 
20, = 3, < n = 3,< 4 = 10, r.f nfl = 8, and <» ns = 2, 
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imposing a vertical-horizontal-vertical (VHV) routing style. 
Without loss of generality, the parameters ft, 0 t 7, 8 are set 
to 1, and It is set to 7. Consequently, for each of the three 
layers o can be obtained, i.e., <r ps = 8, = 4, and a; u# = 3. 

The run times are presented for both runs of the router, 
respectively, using the conventional routing cost function and 
the new routing cost function (Table 1). Experiments were 
done on a HP735. On average run time increases by 21.4%. 
It is clear that this increase in run time comes from the 
determination of the edge cost during routing. However, this 
determination still takes constant time, and thus the complexity 
of the routing algorithm is not changed. 

The EDAM system [13] is used to obtain data concerning 
the failure probability of both layouts. According to (1), the 
failure probability of a layout largely relies on the values 
of parameters and Ao, which are process-environment 
dependent. Thus, probability computation will not make sense 
without accurate values for these parameters. In this paper, 
we make the simplification of computing the layout sensitivity 
instead of the failure probability, because it is believed that a 
low layout sensitivity implies a small failure probability. A 
defect size of 4 is chosen to evaluate the critical areas 
of the benchmark layouts, whose feature size is scaled down 
to 2 /xm. Therefore, the defect size is large enough to reflect 
meaningful layout sensitivities. 

The critical areas with respect to the four types of faults 
are computed for each circuit. The changes in the critical 
areas as well as the layout sensitivities (ALS) are presented 
in Table II. From the data, it may be concluded that for all 
benchmark layouts the critical areas for one layer bridge faults 
(type OE) decrease 22.6% on average, while the critical areas 
with respect to one layer open faults only increase 2.5% on 
average. The critical areas for inter-layer faults, i.e., type IE 
faults and type 1M faults, change very slightly. For the IE 
faults, this is because via-minimization is already considered 
in the original cost function. Therefore, the number of vias will 
slightly increase since the weight of a via is relatively small in 
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Analysis Results 
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Fig. 6. Change in layout sensitivity per circuit. 

the new cost function. The changes of the fault IM seem to be 
random. The reasons are as follows: 1 ) In the original routing 
module, the VHV routing style is chosen. Therefore, large 
area overlap between two metal layers is already prevented 
by the design style. Obviously, the critical areas caused by the 
overlaps will not be decreased significantly by putting an extra 
penalty on them. 2) Since in some cases an increase in overlap 
between two layers will result in a final decrement of the total 
cost, it is also possible that the critical areas with respect to 
Lhese faults will increase. 

The total effect of the new routing strategy is shown in 
the last column in Table II. According to the data, we find 
thai the layout sensitivities can be decreased 6.4% on average, 
if the failure probability is taken into account in the routing 
procedure. 

Fig. 6 shows the sensitivities of the two different layouts per 
design: the white bars represent the sensitivities of the layouts 
made by the original router and the black bars indicate the 
sensitivities of the layouts made by the new routing module. 

To give an indication of the effect the new cost function 
on layout, a snapshot is taken from the layout of benchmark 
circuit apla. Fig. 7 shows the layout obtained using the con- 
ventional cost function and Fig. 8 shows the layout after the 




Fig. 7. Snapshot of layout of circuit 'apla' using conventional routing cost 
function. 




Fig. 8. Snapshot of layout of circuit 'apla' using the new routing cost 
function. 



new cost function is used. Clearly, the wiring on all three 
routing layers is spread more uniform over the available area, 
and the amount of overlap between wires on different layers 
is less in Fig. 8. Both the number of vias and the net length 
increase slightly for Fig. 8. 

VIII. Conclusions 

A novel routing strategy producing layouts that are less 
susceptible to spot defects has been presented. Based on an 
analysis of spot defects, the four types of the main random 
disturbance in IC processes are modelled. A formula indicating 
the failure probabilities of these faults is derived. Combining 
the failure cost function with the conventional cost function, a 
new cost function for the general routing problem is devised. 
By using this new cost function, a good tradeoff between the 
minimization of the total net length and the maximization 
of the manufacturability of a layout can be obtained. The 
experimental data show that the layout sensitivities can be sig- 
nificantly decreased by the proposed routing approach even for 
very dense circuits, while the layout areas are kept the same. 
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Partitioning Logic to Optimize Routability on Graph Structures 



Gopalakrishnan Vijayan 

IBM Research Division 
T. J. Watson Research Center 
Yorktown Heights, NY 10598 

1. Introduction 

We study the problem of partitioning the nodes or a logic 
network (i.e., a hypergraph) on to the vertices of a parti- 
tion graph G, in which the cost function to be minimized 
is the cost of global routing, i.e., the cost of routing the 
nets of the logic on the edges of the graph G. Each 
vertex of the partition graph has a given upper bound 
on the number of nodes of the logic that can be assigned 
to the vertex. The nets of the logic network and the 
edges of the partition graph may have weights associated 
with them, which appear as multiplicative factors in the 
routing cost function. We refer to this partitioning 
problem as the Min-Cost Partitioning on a Graph 
(MCPG) problem. 

The MCPG model generalizes many different formu- 
lations of logic partitioning [1,3,4,6,8,9]. For example the 
classical min-cut problem of [1,3,4] reduces to the case 
where the partition graph G is just a single edge (see 
Figure I). The MCPG model is very general and can 
be applied in many partitioning situations arising in 
VLSI physical design. We describe two such applica- 
tions: 

Residual Logic Partitioning: In many practical chip de- 
signs there are usually a set of macros such as memory 
blocks or data-path stacks, which have to be pre-placed 
due to timing, I/O proximity, power/ground bussing or 
other considerations. The residual area on the chip may 
be highly non-rectangular and contain many bottlenecks 
for wiring. This residual area has to be populated with 
the residual logic (usually control logic). A suitable ap- 
proach is to dissect the residual area into rectangular 
regions, partition the residual logic on to these regions, 
and then perform placement within each region. The 
MCPG model can be used in the partitioning step in 
order to optimize the number of nets that cross the cuts 
separating the rectangular regions. The partition graph 
to be used will be the planar dual of the rectangular 
dissection. 
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Figure 2. Partition Graph (the dual) of a Floorplan 
Partitioning after Early Floorplanning: In timing-critical 
VLSI designs, it is desirable and many times imperative, 
to influence the logic design process using physical de- 
sign constraints [2]. One such method is to do early 
floorplanning using the functional blocks and their 
connectivity, before the synthesis of the logic within the 
functional blocks. The floorplan can then be used to 
drive the logic synthesis process, so as to minimize delays 
on critical paths that traverse across many blocks of the 
floorplan. The synthesis of the functional blocks actu- 
ally gives a initial and rough partition of the logic on to 
the blocks of the floorplan, which satisfies the liming 
constraints. Global wirability can then be improved by 
perturbing the functional partition. The MCPG model 
can be used to achieve this by re-mapping the logic on 
to a partition graph which is the the planar dual of the 
floorplan (Figure 2). 

We now briefly describe a strategy for the modeling and 
solution of the MCPG problem. Given a partition, the 
feasibility and cost evaluation of global routing on the 
partition graph can be modeled as a multicommodity 
flow problem [7], which in turn has an equivalent Linear 
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Programming (LP) formulation. The LP cosi function 
is the weighted sum of flows on the edges of the partition 
graph, which is the global routing cost. In order to find 
a partition that minimizes this cost, the LP model has 
to be invoked and solved repeatedly within the frame- 
work of a partitioning algorithm which maps logic on 
to the vertices of a graph. However, the LP model is too 
large, and its solution too slow, to be used within such 
a partitioning program. Fortunately, there is a necessary 
condition for feasible multicommodity flow called the cut 
condition [5], which can be used to derive a suitable net 
cost model for the nets. The net cost model can be used 
to drive any partitioning heuristic, such as an iterative 
improvement heuristic for multiway partitioning [9] or 
a randomized heuristic. The partition obtained using 
the heuristic is verified for feasibility of global routing 
on the partition graph using the LP model. If feasibility 
is not attained, the weights on the edges of the partition 
graph are modified appropriately to reflect the new 
criticalities of the edges. This results in a change in the 
parameters in the cost function used by the partitioning 
heuristic. The partitioning heuristic is reapplied, and 
this process is repeated until a feasible global routing is 
attained. The LP procedure is thus pushed to the outer 
loop of the overall partitioning strategy, with a cost 
model given by the cut condition being used within the 
inner loop. 

2. A Net Cost Model 

Given a graph <7 = (K,£) and a susbet 5£K, we denote 
by G/S the subgraph of G which is induced by the subset 
,S, We denote the complement V- S of the set S with 
respect to V as S A cut of the graph G, denoted as 
iS,S) is defined by any subset Define 
connj:uts{G) to be the collection of all cuts (S,S) such 
that G/S and G/S are both connected subgraphs, fn all 
references to a cut (5, S) in the following discussion, we 
assume that (5, S) e connjcuis{G). 

Given a set 5 <= connjculs(G) y we define 
cross(S,S) = {(iJ)e E\i<= SJeS) , which is the set of 
edges of G that go between S and S. The total capacity 
of the cut (5, S) is given by c(S, S)= X c e , 
where c e denotes the capacity of edge e. eecroAS ( s i $) 



Given a graph G for the multicommodity flow problem, 
and a cut (5,5), define J[S, S) to be the total amount 
of flow that must cross this cut. 

The cut condition [5] is defined to be 
AS, S) £ c<5, 5) ; VS e conn_cuts(G) .. 
From its definition, it is clear that the cut cotuiition is a 
necessary condition for the existence of a feasible multi- 
commodity flow. 

We now describe a suitable cost model for nets that is 
inspired by the cut condition. Let L = (£/,//) be a logic 
network, where U is its set of nodes and H its collection 
of nets. Consider a partition M of the node set U on to 
the vertex set V of the partition graph G. Let M(u) de- 
note the the vertex of V containing the node u under the 
partition M For a net kW, let 

M(h) = { M(u) \ush) y i.e., each vertex in M{h) contains 
at least one node of the net h. The net h is said to con- 
tribute to a cut (5, S\ if M(h)[}S ± </> and 
M(h)f\S ^ <j) . Tn other words, the cut (S,5) cuts the 
span of the net h, and the net h must flow across the cut 
(5, S). Define the set spanj:uts(M{h)) to be the collection 
of cuts (5, S) e conn_cuts(G) such that the net h contrib- 
utes to the cut (5, 5), i.e., 
span_cuts{M{h)) = 

{{S.S)zconnj:uts{G)\M{h)r\S * </>,M(h)f)S * <f>\ 
A suitable cost that the cut condition suggests for the 
net h under a given partition M is the cardinality of the 
set span_cuts{M(h)). Any reduction in the number of 
these contributed cuts for a net is a positive step towards 
the satisfaction of the cut condition. However, this cost 
model does not take into account the weights of the 
edges belonging to a cut, nor the weight of the net itself. 
Let w e denote the weight assigned to an edge e of the 
partition graph. Let us examine how to assign a suitable 
weight to a cut (5, S) using the weights of the edges in 
crossiS, S) . To take a simple example, let there be just 
two edges with equal weight 10 in cross(S y 5). Intuitively, 
the weight of the cut should be less than the weight of 
both the edges, because two edges are better than any 
one edge as far as flow across cut is considered. The 
harmonic mean 1/(1/10 + 1/10) = 5 has this desired 
property, and is a good measure for the weight of a cut. 
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Generalizing, we define the weight of a cut (S, S) as 

w{s,s) - i/( £ JK)- 

Let w h denote the weight of a net h. The cost of the net 
A, under a partition M, denoted as net_cost{h) can be 
defined as 

net cost(h) « w A x X w(S, 5) • 

5e5/7an_cur.v(M(/i)) 

Note that when all the nodes belonging to a net reside 
in the same vertex of G, i.e., when | M(h)\ = 1, we have 
span_cuts{M{h)) = <t>, and therefore net_cost{h) * 0. Such 
a local net does not contribute to the cost of global 
routing. 

The cost or a partition M is defined to be 
cost(M) = I net cost{h) , i.e., the sum of the costs 

of the nets of the logic network L. 

An Example: 

Consider the partition graph shown in Figure 3. Let the 
weights of all 5 edges be equal to I. Let h { he a net 




Figure 3. A partition graph 

whose spanning set is M(h{) = (1,3). Let /i 2 be a net 
whose spanning set is AV(/t 2 ) = (2,4}. 

spanjuts{M{h x )) = ( ({1.2,4], {3}), 41}, {2,3,4}), 
({1,2}.{3,4}), ((l,4},[2,3))J 

crwjaiA4|,PJ) = l(4,3),(2,3)} ; 

w({l,2,4},{3}) = l/(l + I)=l/2 

cmjai},{2.3,4}) « {(U),(l,4)} ; 

w({l},{2,3,4)) = l/(| + l)=l/2 

crw»({l, 21,(3,4}) = ((l,4),(2,4),(2,3)} ; 

w([\aWA\) = 1/(1 + 1 + 0=1/3 

c^.<(i,4},{2,3)) = {(i,2),(4,2),(4,3)} ; 

h<{1,4},{2,3}) = 1/(1 + 1+0=1/3 



netjcosKhi) = «/2+ 1/2+1/3+1/3 - 5/3 

Similarly, we can derive net_cosi{hrj) = 4/3 
Intuitively, one can observe that there are more routes 
for the net h 2 than for the net h [y and thus h 2 has a lower 
cost. 

3. MCPG Partitioning Framework 

Given a logic network L and a partition graph G, our 
framework for the MCPG solution is as follows: 

1. Select an initial partition M of the nodes of L on to 
the vertices of G. 

2. Select an appropriate initial weight w e for each edge 
e of G. 

3. Repeat the following steps 

a. Execute a partitioning heuristic, which uses 
cost(M) as its cost function, and improve the 
partition M. 

b. Formulate the global routing LP model for the 
partition M on the graph G, and check if the LP 
is feasible. 

c. If the LP is infeasible, then 

Appropriately adjust the weight w e of each edge 

e of G. 
until the LP is feasible 
Thus the LP model is thus pushed to the outer loop of 
the overall partitioning strategy, with a faster heuristic 
using the net cost model being used within the inner 
loop. 

The Partitioning Heuristic: The partitioning heuristic of 
step 3(a) may be any suitable heuristic for the MCPG 
problem. One such multiway partitioning heuristic is 
described in [9]. 

Adjustment of the weights w e : Edges with smaller capac- 
ity are bottlenecks for global routing. Flows on these 
edges should have a higher penalty associated with them. 
Therefore the initial weight w e of an edge e may be^set 
inversely proportional to its capacity c e , as "> c =— , 
where C is a suitable constant. The readjustment of the 
weights in step 3(c) can be done as follows: Temporarily 
set the capacity of each edge to oo, and solve the LP 
again, using the LP cost function, which is the weighted 
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sum of flows on the edges of the partition graph. Note 
(hat (his modified LP is always feasible, Let f e be the 
flow of nets on the edge e in the LP solution. The dif- 
ference f e - c c is a measure of the flow criticality of the 
edge e. A positive difference for an edge e implies that 
there is an overflow on the edge, and that its weight must 
be set higher. A negative difference implies that the edge 
is not critical with respect to global routing, and its 
weight may be lowered. The new weight w e can be set 
as w e (new) = w e {old} + C'x(f r c e ) , where C is an- 
other suitable positive constant. 

Number or repetitions: In practice, we repeat step 3, five 
times, irrespective of whether the LP becomes feasible 
or nol. Even if the LP becomes feasible after the first 
execution of the partitioning heuristic, it may be possible 
to improve the parti Lion, by readjusting the weights and 
repeating this process four more times. If the LP does 
not become feasible after five repetitions, it is up to the 
designer to decide whether this implies an inherent in- 
feasibility or whether the process should be repeated 
with a modified partition graph. This crucial decision 
depends largely on the application. For example, in the 
early floorplanning application, the designer may try a 
different floorplan of the same set of functional blocks, 
which also satisfies the timing and other considerations 
used in the early floorplanning process. Alternately, the 
designer may decide to try a different functional parti- 
tioning. 

4. Conclusions 

The MCPG framework employing an iterative improve- 
ment partitioning heuristic was implemented and used 
successfully on several real designs in a variety of differ- 
ent applications. To give one example, the partitioning 
system was used in the design of a dense CMOS chip 
which had 23 big predesigned macros, with the residual 
logic having more than 1 1 000 logic nodes and about 
12000 nets. The application here was a mixture of resi- 
dual logic partitioning and partitioning after early 
floorplanning. With the use of the MCPG framework, 
the average net length was reduced by about 20%, and 
the number of nets that could not be routed by an au- 
tomatic routing program was reduced by about 60%. 
Similar results were observed for other chip designs 
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which required the use of the general partition model. 
One important benefit of the MCPG model is that it 
gives a practical approach to addressing the global 
routing problem very early in the design process, thus 
speeding up the design cycle time. 
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